Command Palette

Search for a command to run...

PodMine

Jan Stoikka

Matches in: Person
1 episode
Jan 22, 2026• a16z Podcast

Inferact: Building the Infrastructure That Runs Modern AI

Woosuk and Simon from UC Berkeley discuss their open-source inference engine VLLM and their new company Inferact, which aims to build a universal infrastructure layer for running AI models efficiently across different hardware and model architectures.

43:37